Complete Phase Diagram of DNA Unzipping: Eye, Y-fork and triple point 



Rajeev Kapri^*, Somcndra M. Bhattacharjce^* and Flavio Seno^ 
^ Institute of Physics, Bhubaneswar 751 005, India 
^ INFM, Dipartimento di Fisica, Universitd di Padova, Via Marzolo 8,35131 Padova, Italy 

(Dated: February 2, 2008) 

We study the unzipping of double stranded DNA (dsDNA) by applying a pulling force at a fraction 
s (0 < s < 1) from the anchored end. From exact analytical and numerical results, the complete 
phase diagram is presented. The phase diagram shows a strong ensemble dependence for various 
values of s. In addition, we show the existence of an "eye" phase and a triple point. 
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The initial step in DNA replication and RNA tran- 
scription, as the enzyme associates with DNA, is to open 
a few base pairs near it. In case of replication this 
opening takes place near one of the ends, whereas for 
transcription it can be anywhere on the DNA0, H 0. 
The ubiquity of the process calls for a mechanism that 
does not require high temperatures or extreme pH con- 
ditions, unlike melting to which it generally gets associ- 
ated. One such possibility is a force induced unzipping 
transition^. This transition has now been well estab- 
lished both theoreticallyjljl H 0, B H ES IH III El 
and expcrimcntallvfl^ llSl llq. The focus of attention 
so far has been the geometry reminiscent of the DNA 
replication, pulling only the open end of dsDNA. How- 
ever, transcription requires pulling DNA at an interme- 
diate point, often with DNA getting anchored to cyto- 
plasmic membrane|3 in vivo . Similarly, end constraint 
is important in circular DNA as, e.g., in bacteria like E. 
Coli. Ancho ring of one end is also used in single molecule 
experiments [ij, The richer surprises in this type of 
geometry provide the primary motivation for working out 
the full unzipping phase diagram. 

The extensive theoretical workU S H 0, S i, E [H 
ll^ El on the unzipping transition in various avatars of 
the basic Poland and Scheraga model E3| the na- 
ture of the real phase diagramE3| recently obtained for 
lambda phage DNA indicate that the basic features are 
preserved in simpler exactly solvable models 0,H even in 
two dimensions. These basic results include the first or- 
der nature of the unzipping transition and the existence 
of a re-entrant region allowing unzipping by decreasing 
temperature. In this paper our aim is to study the force 
induced transition on the lattice model used previously 
in Ref. d but with the pulling force applied on a base 
pair which is A'^i monomers away from the ancorhed end 
of the DNA molecule (of total length N). We call s the 
fixed fraction Ni/N. See Fig. ^ In single molecule 
experiments, the results may depend on the statistical 
ensemble used0,0|. One may recall that instruments 
like atomic force microscopes |l5| use the fixed distance 
ensemble while the magnetic bead method of Ref E3| 
uses the fixed force ensemble. Therefore, we studied the 
unzipping transition both in the fixed force and the fixed 



distance ensembles, by using analytical and exact trans- 
fer matrix methods, though we concentrate mostly on the 
fixed force case in this paper. 
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FIG. 1: Schematic diagram of DNA unzipping by a pulling 
force at a fraction s (0 < s < 1) from the anchored end. In 
the fixed force ensemble the force g is kept fixed while the 
separation x is kept fixed in the fixed distance ensemble. 



Before describing the model, let us point out a few of 
the basic results we have obtained. The phase diagrams 
in both the fixed distance and fixed force ensembles are 
obtained. The qualitative features of the phase diagram, 
especially the nature of the phases, the s-dependence and 
the ensemble dependence are generic as can be shown 
by general arguments. An "eye" -like configuration exists 
for all s < 1 in the fixed distance ensemble, either as a 
distinct phase (when it extends upto the anchored end) or 
as two "Y" 's joined together (see fig. EJ where a "Y" is a 
coexistence of the unzipped and the zipped phases. These 
configurations resemble the "Y" fork in replication and 
the transcription bubble. In a fixed force ensemble, an 
eye exists only for low values of s (s < 1/2 in our model) 
and the phase diagram depends on s. For values of s 
where the "eye" phase exists, there is triple point at the 
intersection of the zippcd-eye (Zp-Ey), eye-unzipped (Ey- 
Uz), and zipped-unzipped (Zp-Uz) phase boundaries. 

The model is defined as follows. The two strands of 
a homo-polymer DNA are represented by two directed 
random walks on a d = 2 dimensional square lattice. 
The walks, starting from the origin, arc restricted to go 
towards the positive direction of the diagonal axis (z- 
direction) without crossing each other. There is a gain 
of energy — e (e > 0) for every contact (i.e. separation 
a; = 0). The directional nature of the walks takes care 
of the correct base pairing of DNA. In addition to this 
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bonding, a force g acts along the transverse direction (a;- 
direction) at a fixed fraction s (0 < s < 1) from the an- 
chored end (z = 0). As is well-known, this force, though 
acts at a point, affects the bulk behaviorQ. The quan- 
tities of interest depend on the ensemble one is working 
with. For example, it is the average separation {x) at 
the point of application of the force in the fixed force 
ensemble, whereas, in the fixed distance ensemble, it is 
the average force (17) needed to maintain the distance x 
between the two strands, (g, a;) constitute a thermody- 
namic conjugate pair. 

A taste of the surprise for s 7^ 1 can be gleaned from 
a simple analysis that is exact in the low temperature 
region. For s = 1, this argument gives the exact reentrant 
phase boundaryjilllil. If s < 1/2, at T = 0, the force 
opens the chain maximally so as to form an eye (extensive 
in length). The energy E and entropy S of the eye with 
respect to the completely zipped chain are E = —gasN + 
2esN and S ~ — 2siVln/ih, where fXb is the connectivity 
constant of the bound phase and a is a geometric factor. 
Throughout this paper we take the Boltzmann constant 
fcs = 1. For our lattice model, /J,f, = 2 and, we set a = 1 
by choosing the elementary diagonal of the underlying 
square lattice as the unit of length. A transition from the 
zipped (Zp) state is therefore possible if 5 > gc{s,T) = 
2(e-|-nn /ib)/a which is double the force required for the 
unzipping transition at s = 1. The situation is different 
for s > 1/2, where complete unzipping is possible at T = 
0. In the completely unzipped (Uz) state, the energy 
and entropy with respect to the zipped state are E = 
Ne — gasN and S = — N \n ^i, + 2{1 — s)N In fii where /ii 
is the connectivity constant for a single chain. The low 
temperature phase boundary is given by _gc(s,T) = (e + 
Tln/Xb//iJ^^ *'')/(sa). For our lattice problem ^1 = 2. 
Hence, there will be no low temperature re-entrance if 
s = 1/2. These s-dependences match the exact results. 

To trace out the exact phase boundary we use the re- 
cursion relation method. Let Dt{x,x') be the partition 
function with separations x and x' at the two extreme 
ends of a dsDNA of length t. Then, 

Dt+i{x,x') = [2Dt{x,x') + Dt{x,x' + 1) 

+Dtix,x' -l)]{l + P5^>,o),il) 

with P = — 1, (3 = 1/T, and initial conditions 
Dq(x,x') = Sx.x'- Mutual exclusion is ensured by 
D{x,x') ~ whenever any (or both) of the two argu- 
ments x,x' < 0. One can construct two other parti- 
tion functions, (i) dt{x) = Dt{0,x) when one end is held 
fixed, and (ii) dt{x) = '^^,Dt{x,x') when one end is 
free. Of these, dt{x) has been used in previous stud- 
ies of the force at the end caseQ where the phases and 
the transitions come from the singularities of the gener- 
ating function, Q{z,f3,g) = X^t ■^*6^^^dt(x). These 
singularities are zi ~ 1/4, Z2 = (2 -|- 2 cosh/Jg)"^ and 
Z3 = V— e"*^ 4- 1 — 1 + e^^. The zero force melting, corn- 
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FIG. 2: Fixed distance ensemble T vs x/{sN) phase diagram 
for (a) s = 0.2 and (b) s — 0.75 . The zipped and the eye 
phases are shown by thick lines. The coexistence regions are 
marked by different shades or vertical lines. Xc{T), defined 
after Eq. is represented by the dotted line in (b). 



ing from zi = zg, is at = l/ln(4/3) = 3.476059497.... 
The unzipping phase boundary can be determined in the 
fixed distance ensemble by noting that the force required 
to maintain the separation x is —Td\ndN{x)/dx. By 
using disiix) ~ \{z3Y /z^^"^ for large N with A(z) = 
(1 - 2z - Vl - 4z)/(2z), one gets 

(7,(T)=.ge(s = l,r) = -rinA(z3), {Zp^Uz). (2) 

This is the known s — 1 phase boundary]^ coming from 
Z2 = Z3 in the fixed force ensemble. On this bound- 
ary, the end separation is given by Xc{T) = x/N = 
ta,nh.[g c{T)/T]. The phase coexistence on this boundary 
gives the Y-fork structure, which we simply call a "Y" . 

Fixed distance ensemble: If the distance or sepa- 
ration of the two strands at t = sN is kept fixed at x, 
while the DNA is anchored (x = 0) at i = but free 
at the other end at t ~ N, the partition function is {zs 
dependence of A suppressed) 

Zn{x,s) = dsN{x)d(^i_s)Nix) 

« A^zg-^^ (4(1-^)^ + A^Z3-(^-^)^) . (3) 

In the limit N 00 for a fixed s, the larger of the two 
terms (in the big parenthesis on the right hand side) con- 
tributes to the free energy. For 



— X( T)- '^^^ ln(4z3) 
sN s ln(A(z3))' 



(4) 



the larger term is the second one, otherwise it is the first 
one. Therefore, we get a phase boundary 

5e(s,r) = 25,(T), if EqH (Zp ^ Ey) (5a) 
= 5c(T), otherwise, (Zp<4>Uz). (5b) 

With the increase of the separation x, the end point gets 
detached at the critical value x = sNX{s,T), provided 
X{s,T) < 1. Once all the bonds are broken the two 
open tails behave like free independent chains. In such 
a situation, the force required to maintain the separa- 
tion is just like the s = 1 end case (in the sN — > 00 
limit) as we see in Eq. I5al For this to happen we also 
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FIG. 3; (a) Scaled separation x/{sN) versus force isotherm 
for different s at T = 1.0 with A'^ = 256 in the fixed distance 
ensemble. The location of X{s,T) (see text) is shown by 
the solid line, (b) Average separation (fixed force ensemble) 
between the i*'' monomer of the DNA of length A'^ = 64 for 
T = 1.0 for a force g = 2.7 at s = 0.25 at two different 
temperatures. 



require X{s,T) < Xc{T), else the DNA will be in the 
unzipped phase. Fig. |21 shows the phase diagram on a 
temperature-distance plane for two values of s in the fixed 
distance ensemble, though in the force-temperature plane 
the phase boundaries are independent of s as follows from 
Eqs. I5al5bl An "eye" of the type shown in Fig ^ occurs 



in the coexistence region shown by solid vertical lines 
in Fig. 121 and is to be interpreted as two "Y" 's joined 
together. This configuration is analogous to the tran- 
scription bubble produced e.g., by RNA polymerase^ 
a subunit of which keeps the two strands of DNA sepa- 
rated. 

To supplement the exact results, the partition func- 
tion for the two strands starting from origin is obtained 
numerically by using the exact transfer matrix technique 
for the recursion relation Eq. In the fixed distance 
ensemble, the distance between the two strands x is var- 
ied at a step of 1 (length of diagonal of the square lattice) 
from ("zipped") to sN ("completely stretched"). The 
quantity of interest, the average force required to main- 
tain the distance x between two chains, is calculated by 
using finite differences in free energy. The scaled sepa- 
ration between the two strands of DNA, x/sN, versus 
the corresponding average force at T ~ 1.0 is shown in 
fig. for several values of s. Fig^ shows the eye for- 
mation in the fixed force ensemble and is discussed next. 
When the end monomers of the chains (s = 1.0) are main- 
tained at a fixed distance, there is only one plateau at the 
critical force given by Eq. ^ (the Zp-Uz phase bound- 
ary). When s < 1.0, the force-distance isotherm has two 
plateaus as per Eqs. (|5a|) . (|5bp with a step that matches 
with the critical value X{s,T). We do not go into further 
details of this phase diagram here because the subtleties 
are more prominent in the fixed force ensemble. 

Fixed force ensemble: In the fixed force ensem- 
ble, the generalization of the generating function de- 
fined for the end case below Eq. ^ is Qs{z,(3,g) = 
Y^x>of^'^^''J2t^*^tix,s). Using Eq. @ , this can be writ- 



ten as 

Gsiz, f3,g)^Yl e''^^{[A(4(i-^)/^zi/^)]- -t- [Xiz)^}, (6) 

where the first term on the right hand side in curly 
bracket represents the unzipped state while the second 
term is for the eye state. The additional singulari- 
ties in z are then Z2 ~ [4-'^~'*[2(l + coshf]g)Y]^^ , which 
goes over to the Z2 mentioned earlier for s = 1, and 
Z4 = {2[1 + cosh(/3_g/2)]}^^. The zipped to unzipped 
phase transition occurs at Z2 = z^ while a zipped to eye 
phase transition takes place at 23 = 24. 




FIG. 4: The gc versus T phase diagram in the fixed force 
ensemble. Lines are the exact results while the points are 
from numerics, (a) For 0.5 < s < 1. The inset shows the 
re-entrant region for s — 0.5. (b) For s = 0.35 and s — 0.25. 
The inset shows the triple point for s — 0.35. 

The s dependence of the singularities show that there 
cannot be an eye phase in the fixed force ensemble if 
s > 1/2, even though one may open an eye of the type of 
Fig. ^in the fixed distance ensemble. In this situation, 
only transition possible is unzipping with an s-dcpendcnt 
boundary given by 



5e(s,r) = TlnA(4(i 



) (Zp^Uz) (7) 



which matches with gdT) for the end case. In addition, 
close to T = 0, we see g^s, T) « | -|- In 2 which cor- 
roborates the results from the simple argument, including 
the vanishing slope at T = for s = 1/2 ( the absence 
of a low temperature re-entrance). The inset in Fig. 0Ji 
shows that there is still a small region in the intermediate 
temperature range where one does see a re-entrance. 

With a force at a point s < 1/2, a phase boundary 
comes from Z3 = 24 which matches with the boundary 
we already derived in Eq. (|5a() . This is the transition to 
eye in presence of a force, as found in the fixed distance 
ensemble also. However the eye phase cannot continue 
for the whole temperature range, definitely not at the 
melting point where the unzipped phase should be recov- 
ered. The unzipped phase boundary comes from Z2 = Z4 
which yields Eq. |(7|l. The lower of the two curves would 
determine the thermodynamic phase boundary. See Fig. 
^ These results also suggest the possibility of an eye to 
unzipped phase transition which however eludes the ap- 
proximation done in Eq. © (based on Eq. ©). This 
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phase boundary is determined numerically below. The 
intersection of the three boundaries, which occurs only 
for s < 1/2, is a triple point. 

To extend the analysis for s < 1/2, and to verify the 
analytical results, we use the exact transfer matrix tech- 
nique, mentioned already, but now with an applied force. 
The average separation (x) between the two chains at the 
site of application of force at a temperature T, is calcu- 
lated by taking the finite difference of exact free energies 
as g is increased in steps of = 0.001. The critical force 
in the thermodynamic limit is determined by iV — > cx) ex- 
trapolation of the crossing points of (x) versus g curves 
for pairs of length N. We take N from 400 to 1000. The 
results are shown in Fig. 01 with very good agreement 
with the analytical results. 

The maximum separation between the chains at the 
point of application of the force is sN, and at this sep- 
aration, the end monomers are always in an unzipped 
state for 0.5 < s < 1 even for T 0. This is not so for 
s < 0.5. In this case, the end monomers will of course be 
unzipped at higher temperatures but that does not rule 
out the possibility of a zipped phase at low temperatures. 
To study how the end monomer separation behaves with 
T when s < 0.5, the force is fixed to a value which lies on 
the phase boundary obtained above and T is increased. 
At low temperatures, we find that the end monomers are 
in zipped state (i.e. in contact) even when the separa- 
tion between the two chains, where force is applied, is 
maximum. At a particular temperature, which depends 
on the fraction s, the end monomer separation becomes 
macroscopic, signaling an Ey-Uz phase transition. Fig. 
0] shows the phase boundary obtained by repeated appli- 
cation of this procedure.. 

The complete phase diagram for s = 0.35 and 0.25 are 
shown in Fig. 2Jd. The three coexistence lines meet at 
a triple point Tp which shifts with s. The triple point 
moves towards the high temperature side when s is de- 
creased and merges with the melting point for s ~ 0.25. 
The inset shows the details around the triple point. The 
limit s — !■ is singular because the eye phase cannot exist 
and a force applied at the anchored point cannot open a 
chain. In that limit, the Ey-Uz boundary becomes verti- 
cal (parallel to the force axis) and is the only meaningful 
phase boundary. 

The Ey-Uz transition can be seen in Fig. ^jp which 
shows the average separation between the monomers of 
the DNA of length iV = 64 with a force at s = 0.25. We 
calculate the separation for at two different temperatures, 
both at g = 2.7 which lies on the phase boundary of the 
zipped and the eye phases at T = 1, and another for 
the same force but at T ~ 2 deep in unzipped region (see 
fig. 21). The monomers in the outer most part of the DNA 
are in the zipped state for T = 1 and g ~ 2.7 showing 
the formation of the eye but for T = 2 the DNA is in the 
unzipped phase. In the limit of A'^ — > oo the number of 
bound pairs at the end of the chain is extensive (oc N) 



in the eye phase but not in the unzipped phase. 

We may now summarize some of the results of this pa- 
per. (1) The phase diagrams in the fixed distance and 
fixed force ensemble are shown in Figs. El and 01 which 
also show when the "eye" appears as a distinct phase or 
as two "Y" 's. (2) There is a low temperature re-entrance 
in the fixed force ensemble for all values of s except for 
s = 1/2. In the latter case the re-entrance is restricted to 
a small intermediate temperature range. (4) In the fixed 
force ensemble, the phase boundary shifts with s as it is 
decreased from s = ltos = l/2. In this range there is no 
triple point. (5) For s < 1/2, the low temperature phase 
boundary representing the zipped-to-eye phase transition 
in the fixed force case is independent of s and it inter- 
sects the s-dependent zipped-to-unzipped boundary at 
a triple point. There is an additional eye to unzipped 
phase transition line in the large force regime. (6) The 
triple point shifts towards the zero-force melting point 
Tc as s is decreased. Although our approach is based on 
a coarse grained model, we believe that our results are 
robust to be observed by high precision measurements of 
DNA unzipping under a pulling force. 

We thank Sanjay Kumar for suggesting the relevance of 
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